On the properties of information gathering in quantum and classical measurements 
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The information provided by a classical measurement is unambiguously determined by the mutual 
information between the output results and the measured quantity. However, quantum mechanically 
there are at least two notions of information gathering which can be considered, one characteriz- 
ing the information provided about the initial preparation, useful in communication, and the other 
characterizing the information about the final state, useful in state-preparation and control. Here 
we are interested in understanding the properties of these measures, and the information gathering 
capacities of quantum and classical measurements. We provide a partial answer to the question 'in 
what sense does information gain increase with initial uncertainty?' by showing that, for classical 
and quantum measurements which are symmetric with respect to reversible transformations of the 
state space, the information gain regarding the initial state always increases with the observer's 
initial uncertainty. In addition, we calculate the capacity of all unitarily covariant and commutative 
permutation-symmetric measurements for obtaining classical information. While it is the von Neu- 
mann entropy of the effects which appears in the latter capacity, it is the subentropy which appears 
in the expression for the former. 

PACS numbers: 03.67.-a,03.65.Ta,89.70.-f c,02.50.Tt 



I. INTRODUCTION 

In this article we will be concerned with, among other 
things, the question 'what is the capacity of a given mea- 
surement to provide information?' In considering this 
question, it is important to note that for quantum mea- 
surements the answer is not unique, and depends on what 
the information is about. Thus, we will also be concerned 
with the question of how information gain can be quan- 
tified, and in doing so we will present two kinds of in- 
formation gain useful for different applications. Finally, 
for a given measure of the information gain, we will be 
interested in asking what properties it has, in particular 
how it depends upon the observer's initial uncertainty. 

Since we will be concerned with both classical and 
quantum measurements, we begin in Section ^ by dis- 
cussing the relationship between the two, and in partic- 
ular how classical measurements may be described as a 
subset of quantum measurements, providing us with a 
unified description of both. In Section IIIII we describe 
two physically motivated operations which can be used 
to combine measurements, and introduce some notation. 
In Section HVl we discuss two kinds of information gath- 
ering, information about the initial ensemble, and infor- 
mation about the final state, and consider some ques- 
tions regarding their respective properties under opera- 
tions on measurements. In Section we pause to con- 
sider quantifying disturbance, and discuss how one can 
define two measures of disturbance corresponding loosely 
to the two notions of information gain considered in the 
Section IIVI In Section IVII we consider the information 
gain regarding the initial preparation, and show that 
there is a precise sense in which it can be said to in- 
crease with the observer's initial uncertainty. This moti- 
vates the definition of two classes of measurements, the 
permutation-symmetric measurements and the unitarily 



covariant measurements. Finally, we calculate the capac- 
ity of these classes to obtain information about the initial 
preparation. 



II. CLASSICAL AND QUANTUM 
MEASUREMENTS 

The purpose of this section is to describe the relation- 
ship between classical and quantum measurements, and 
discuss the special role of bare quantum measurements, 
the class with which we will be primarily concerned 
throughout. Quantum measurements 0,01 are described 
by sets of operators, {l^n}, which satisfy X^n^n^" — 1- 
For efficient measurements each operator in the set cor- 
responds to a possible outcome of the measurement 
(Inefficient measurements, on the other hand, are merely 
efficient measurements in which the result is not known 
with certainty - however, we will only be concerned with 
efficient measurements in what follows). Due to the po- 
lar decomposition theorem we can write each of the 
operators il„ as a product of a unitary and a positive 
operator. As a result, the state following a measurement 
may be written as 



Pn 



(1) 



where J7„ is a unitary, and Qn is positive. So long as 
the observer has the necessary Hamiltonian resources, 
she can always perform any unitary operation after the 
measurement, and conditional upon the result. Such an 
action is often referred to as feedback 0,IEI3|- Because of 
this, we can conclude two things from the above expres- 
sion for the final state resulting from the outcome n. The 
first is that, using unitary feedback, one can simulate any 
quantum measurement by making a measurement whose 
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operators r2„ are purely positive. The second is the con- 
verse; that, given any measurement, one can always use 
unitary feedback to simulate a measurement in which the 
operators are purely positive. However, there is a deeper 
significance to the polar form given in Eq.(^, but to see 
this we must turn to classical measurements. 

Classical measurement theory, which existed, natu- 
rally, long before quantum theory, describes measure- 
ments performed on classical objects and derives directly 
from Bayesian statistical inference. 0, 1^, While 
classical measurements are not usually written using the 
same formalism as quantum measurements, they can be, 
and this is to be expected because quantum measure- 
ments must reduce to classical measurements under cer- 
tain conditions; quantum measurement theory reduces 
to the theory of Bayesian statistical inference when both 
the density matrix and the measurement operators are 
all diagonal in the same basis. 

To describe classical measurements with the same for- 
malism as quantum measurements, one writes the ob- 
server's initial classical state of knowledge (being a prob- 
ability density) about a classical variable x, as a positive 
diagonal matrix, pc- Once one has done this, a classical 
measurement may be described by a set of positive diag- 
onal operators P„, such that = 1. The state that 
results from the n*'' outcome is 



Pn = 



PnpPn 

Tr[P,?p]- 



(2) 



For an explanation of how this form is derived from the 
theo ry o f Bayesian inference, the reader is referred to 
Ref. [ij. Thus, classical measurements are described by 
a subset of quantum POVM's, in which all the operators 
must be positive, mutually commuting and commuting 
with the 'density operator' describing the initial classical 
state of knowledge of the observer. 

However, it is useful to note that we can add to the 
classical formalism deterministic transformations. In the 
classical case, the only deterministic operations which 
can be performed are permutations of the states of the 
random variable. In our formalism, these are imple- 
mented by multiplying the classical (diagonal) density 
matrix on the left and right by permutation matrices. 
(Permutation matrices are such that every element is zero 
or unity, and there is only one non-zero element in each 
row and column.) If we allow the observer to perform a 
deterministic transformation after the measurement, and 
conditional upon the outcome, then we have 



Pn Pn pPn Pn 



(3) 



When we augment classical measurement with classical 
deterministic transformations, one could refer to the re- 
sult as a classical operation. While Eq.Q is sufficient 
for our purposes, it does not give the most general form 
for a classical operation. To do this one would include, 
in addition to the above, a stochastic map describing an 



irreversible randomization of the system. We note that a 
discussion of the relationship between quantum and clas- 
sical measurements, using a different formalism, is given 
in [13. 

There is now a close similarity between the above ex- 
pression for classical operations, and the expression for 
quantum operations given in Eq.l^. In fact, this sim- 
ilarity is not just superficial; there is a sense in which 
we can think of the quantum positive operators Q„ as 
characterizing the unadorned act of measurement, as the 
Pn do in classical operations. To see this we consider the 
von Neumann entropy of the average state of the sys- 
tem after the measurement. Classically, if we allow only 
measurement, we have 



^PnPn ^ PnpPn = ^ PnP = 



(4) 



so that the entropy of the final average state is the same 
as the entropy of the initial state. Denoting the difference 
between the initial and final entropy as 



Do{p) = S 



S[pi 



(5) 



(a notation which will be motivated later) then for clas- 
sical measurements we have Do — 0. However, if we al- 
low conditional deterministic transformations (feedback), 
then it becomes possible to arrange that Do < 0. That 
is, feedback allows us to reduce the final average entropy 
below the initial entropy, which is the essence of feedback 
control. 

A similar result exists for quantum measurements, 
which is due to a theorem of Ando ■ Ando's theorem 
states that for quantum measurements in which all the 
operators are positive ($!„ — Qn, Vrt), Do{p) > 0. In ad- 
dition, when one allows the use of deterministic feedback 
(unitary operators), as in the classical case one can ar- 
range that Do{p) < 0. This therefore provides an initial 
quantitative motivation for thinking of the positive oper- 
ators which appear in the polar decomposition for quan- 
tum operations as the equivalent of the measurement op- 
erators in classical operations. Moreover, we expect that 
other relations exist in which quantum operations with 
positive operators parallel classical measurements, and 
we give a conjecture regarding a second example of such 
a relation in Section Hvl 

In view of the above argument we will refer to quantum 
measurements in which all the measurement operators 
are positive as bare measurements. An important feature 
of bare measurements is that, due to the Polar decompo- 
sition theorem, if one is interested purely in the process of 
acquiring information, it is enough to study these alone, 
since post-measurement unitary transformations do not 
affect this process. We note that such measurements have 
been referred to previously by various authors as, pure 
measurements [l4| , measurements without feedback |l5j , 
and square root measurements |16|| . 
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III. OPERATIONS ON MEASUREMENTS 

It is useful at this point to introduce a few definitions. 
In what follows we will denote measurements by calli- 
graphic letters. If we write, 

M = {r!„:n=l,...,M} (6) 

we will mean that A4 denotes the measurement described 
by the operators {i7„}. The operators {fin} are often 
referred to as the Kraus operators 0, although in what 
follows we will refer to them simply as the measurement 
operators. We will follow Kraus's terminology and refer 
to the operators i?„ — ill^fln as the effects. 

There are at least two ways in which a physical proce- 
dure may be used to combine two measurements together 
to form a new measurement. One we will refer to as mix- 
ing |l7j |. and the other as concatenation. 

Mixing: In this case the observer chooses between two 
measurements, Ai = {f>-m ■ ™ — li • ■ • i and Af = 
{51^ : n = 1, . . . , N} on a random basis, choosing to use 
M with probability p, and the other with probability 
1 — p. The resulting measurement is given by 

Q ^ {V^in^}lj{V^^n} (7) 

= pM + il- p)N. (8) 

This is a kind of 'addition' operation for measurements. 

Concatenation: In this case the observer performs two 
measurements, one after the other. If M is made first and 
second, then this is equivalent to making the single 
measurement Q, where 

S = ■.n=l,...,N-m^l,...,M} (9) 

= MoAf. (10) 

One might think of this as a kind of 'multiplication' op- 
eration for measurements. 

IV. TWO INFORMATION-THEORETIC 
CAPACITIES 

We are interested here in quantifying the amount of in- 
formation which a measurement provides about the sys- 
tem being measured. In classical terms one might think 
of concepts such as the 'accuracy' of a measurement, or 
the 'resolving power'. It is worth noting that, in gen- 
eral, the amount of information that the measurement 
provides depends, in a way we will make precise later, 
on the state of the system being measured. However, in 
what follows we will be specifically interested in quanti- 
ties which depend only on the measurement itself. Such 
quantities can be obtained by optimizing over the mea- 
sured state, or simply by fixing it, if a natural choice 
exists. 

The first thing to consider when speaking about the 
information obtained in a measurement is what this in- 
formation is about. For quantum systems, there are two 



choices, since one can consider the information provided 
about the initial preparation, or the information pro- 
vided about the state which results from measurement. 
In classical measurements there is only one choice, be- 
cause these two things are the same. 

One kind of information, that about the initial prepa- 
ration, is useful in communicating classical information. 
If Alice encodes a message in a quantum system, then the 
ability of a measurement to extract information about 
the initial preparation tells us how much classical infor- 
mation Bob can obtain from Alice by employing the mea- 
surement. Thus, a measure of this kind of information is 
a classical capacity. 

Alternatively, one can ask how much information one 
obtains about the final state - that is, the state that we 
are left with once we have made the measurement. How 
much one knows about a quantum state can be character- 
ized by the von Neumann entropy of this state. Thus, a 
measure of the amount of information which the measure- 
ment extracts about a state can be obtained by taking 
the average decrease in the von Neumann Entropy result- 
ing from the measurement. This motivates the definition 
of the purification capacity (and the related concept of 
the measurement strength [l^ ITsjl'). We now deal with 
each of these measures of information in turn. 



A. Classical Capacity 

There is now a considerable body of work on the ca- 
pacity of quantum channels for transmitting classical in- 
formation (see, e.g. 0,Iil,|2O|,|2ll|2a,|2iiii23). Here, 
however, we are interested in examining the capacity of 
a measurement to obtain classical information encoded 
in a quantum state. In defining the capacity of a chan- 
nel, one must maximize over all measurement strategies 
and encodings, in order to find the amount of information 
that can be communicated by the channel. To obtain the 
capacity of a measurement, instead one assumes a per- 
fect channel, fixes the measurement and optimizes over 
all possible encodings, thus providing a quantity which 
characterizes the measurement rather than the channel. 

In contrast to classical channels j2^, in considering 
the classical capacity for quantum channels, there is a 
distinction between the 'single shot' capacity and the 
asymptotic capacity. The single shot capacity tells us 
how much classical information can be transmitted by a 
single use of the channel. If the channel is a single qubit 
being transmitted from sender to receiver, for example, 
then the single shot capacity tells us how much informa- 
tion can be obtained by measuring just the one qubit. 
The asymptotic capacity is the amount of information 
that can be transmitted, per qubit, if we are allowed 
to measure multiple qubits, in the limit of an infinity 
number of qubits. In general, the asyrnptotic capacity is 
greater than the single shot capacity [2^ l28l 129^ . How- 
ever, King and Ruskai 30] have recently shown that this 
is only true if joint measurements are made across multi- 
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pie uses of the channel. Thus, as a property of a measure- 
ment on a single system, the notion of classical capacity is 
well defined without reference to the number of systems 
being measured. 

The amount of classical information that is provided 
by a measurement, A4, about an encoding, e, and which 
we will denote by AIi{M^e), is the mutual information 
between the encoding and the output results. An encod- 
ing is the set of states chosen as the alphabet with which 
to write the information, along with the probability with 
which each state will be selected. We will denote an el- 
ement of the set of all possible encodings by e, and the 
set of states and probabilities corresponding to this en- 
coding as {pi : i — and {P(i) : i = 
respectively. The state of the prepared system from the 
point of view of the observer making the measurement is 
p — J2i P{^)Pi- With these definitions, the information 
provided by the measurement is given by 

AhiM,e) = H[P{i)] - QU)H[Pm], (11) 



where 



Q{j) = Tr[n]n,p] 



(12) 

is the a priori probability density of the output results, 



TT[n]njp,]p{i) 



j:^TT[n]n^p,]p{i) E.Q(jW^(») 



(13) 



is the observer's state of knowledge of the initial prepa- 
ration on receiving the outcome j, and H is the entropy. 
The quantity Q{j\i) introduced above is the probability 
density of the output results given that the initial state 
is Pi. We will continue to use this notation throughout; 
that is, we will denote the probability densities of the 
initial states by P, and the densities of outcomes by Q. 
Note that the mutual information is simply the average 
reduction in the entropy of the distribution of the initial 
preparation, P(i), due to the measurement. It is useful 
to note that the mutual information can also be written 
in the reverse form with input i and output j swapped. 
That is 



AI,{M,e) = H[Q{j)] - PmiQiM]- 



(14) 



Naturally the classical capacity of a measurement should 
be defined as the supremum of A/i {A4 , e) over all initial 
encoding, and is therefore given by 



This is because, since the observer chooses between mea- 
surements M and A/" on a random basis, the preparer can- 
not know which measurement will be used, and therefore 
must use the same encoding irrespective of the measure- 
ment made. If the optimal encoding for both measure- 
ments is the same, then we have equality. Otherwise one 
measurement necessarily extracts less information than 
is optimal, hence the inequality. 

The behavior under concatenation is straightforward 
to obtain for classical measurements. If we denote the 
mutual information between two sets of random variables 
X and Y as M{X, Y), then the capacity of the concate- 
nated measurement is the supremum of M(i\j,k) over 
initial encodings, where i labels the elements of the en- 
coding, and j and k label the outcomes for the respective 
measurements M and J\f. Rearranging the expression for 
M{i\j, fc), one finds that 

M{t\j, k) = M{i\j) + M{i\k) - Mij\k). (17) 

The point here is that, while two measurements may be 
independent, the outcomes of the measurements, when 
made in sequence on the same system, are not, in general, 
uncorrelated. Since AI{j\k) > 0, we have M{i\j,k) < 
M{i\j) + M{i\k). Since the classical capacity of A4 and 
Af are the respective supremums of M{i\j) and AI{i\k) 
over initial encodings, we have 

C{MoM) = sup M{i\j,k) 

e 

< sup[M{i\j) + M{i\k)] 

e' 

< supAf(i|j) -|-supM(i|fc)] 

e' e" 

= C{M) + C{N). (18) 

The first inequality is only saturated when M{j\k) ~ 
and the second is only saturated when the optimal en- 
coding is the same for both measurements. 

For quantum measurements at first it is tempting to 
suggest that C might also be subadditive. However, if we 
allow all classes of measurements, in particular measure- 
ments in which the operators f2„ are not purely positive, 
but have a unitary component, then it is not hard to find 
examples which break the inequality. That is. 



3M,Af , C{MoAf)> C{M) + C{Af). 



(19) 



C(7W) = supA/i(7W,£). 



(15) 



Nevertheless, it is important to point out that the same 
can be said for the mutual information for classical oper- 
ations if we allow conditional deterministic transforma- 
tions. As a result we are prepared to conjecture that for 
hare measurements the purification capacity satisfies the 
classical relation, Ea. H18|l . 



It is natural now to ask how C behaves under the op- 
erations of mixing and concatenation. For mixing it is 
clear that 

C{pM + (1 - p)N) < pC{M) + (1 - p)C{Af). (16) 



B. Purification Capacity 

While the classical capacity was motivated by consid- 
ering the transmission of classical information, the purifi- 
cation capacity is motivated by considering the problem 
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of state preparation and more generally quantum feed- 
back control [13. ITU. In this case, instead of being in- 
terested in the amount of information one obtains about 
the initial preparation, one is interested in how well one 
knows the final state that results from the measurement. 
This is because the more pure the final state, the more 
one can subsequently control the system. That is, one 
is interested in the amount by which, on average, the 
measurement purifies the state of the system. 

For the purposes of control, the von Neumann entropy 
is a sensible measure of the observer's uncertainty of a 
state, since it measures the minimum possible entropy 
of the outcomes of a future measurement. That is, if the 
observer has the ability to perform any measurement, the 
von Neumann entropy measures the unavoidable residual 
unpredictability of the measurement result. The outcome 
of a measurement on a state with zero von Neumann 
entropy can, in the best case, be predicted perfectly. 

To characterize the amount of information that a mea- 
surement provides about the final state one can therefore 
average the von Neumann entropies of all the possible fi- 
nal states over the measurement outcomes. Subtracting 
this from the von Neumann entropy of the initial state 
gives us a measure of the increase in the observer's in- 
formation about the state of the system provided by the 
measurement: 

AIfiM,p) = S{p)-J2mSiPr), (20) 

i 

where A4 is the measurement and p is the initial state. 
Nielsen has shown t hat, as one would intuitively expect, 
Alf is non-negative [31|. (See also ,151] for a remarkably 
simple proof of the this result discovered by Fuchs.) 

In order to obtain a quantity which depends only on 
the measurement, two possibilities are to fix the initial 
state, or to maximize Alf over all possible initial states. 
If we choose the second option, then we have a quantity 
which is the final-state equivalent of the classical capac- 
ity. We will refer to this as the purification capacity, and 
denote it by K. Thus we define 

K{M) ^siipAIf{M,p). (21) 

p 

If instead we wish to fix the initial state, which was 
the procedure adopted in then it is important to 
choose a state which is invariant under unitary trans- 
formations, so that the resulting quantity, which was re- 
ferred to in |l4j as the measurement strength, satisfies the 
intuitive notion that it should be invariant under unit ary 
transformations of the measurement. In Refs. 
the measurement strength, which we will denote by Kj, 
was defined by fixing the initial state to be the maximal 
uncertainty state p = I/N. This gives 

Ki{M) = HN) -Y,PnS[En], (22) 

n 

where En — En/Tr[En], and we have used the fact that 
the eigenvalues of ^n^n the same as those of En in 
Hi. By the definition of K we have Ki[M) < K{M). 



It is now natural to ask how the purification capacity 
and the measurement strength behave under the oper- 
ations of mixing and concatenation. Under mixing one 
has trivially that Kj is linear, ie. 

Ki{pM + {l-p)M) = pKiiM) + {l-p)Ki{M), (23) 

and that K is concave: 

K{pM + {l~p)N) < pKiiM) + {l-p)Ki{N). (24) 

While it is not immediately obvious, for bare measure- 
ments Ki is invariant under a change in the order in 
which measurements are concatenated. That is, 

Ki{MoN) = Ki{NoM) , M,MeB, (25) 

where B denotes the set of all bare measurements. This 
follows almost immediately from the fact that, for any 
two operators A and B, AB has the same eigenvalues as 
BA Ullsl. 

For general quantum measurements, it is not hard to 
show that neither the measurement strength nor the pu- 
rification capacity satisfy the classical relation, given by 
Ea. (|18|l (Note that for classical measurements K = C .) 
However, we conjecture that for bare measurements both 
the strength and purification capacity satisfy the classical 
relation, e.g. 

K{MoM)<K{M) + K{N) , M,AfeB. (26) 

V. TWO MEASURES OF DISTURBANCE 

Although disturbance is not the primary focus of this 
article, we feel that it is worth pointing out here that, 
corresponding to the two measures of information dis- 
cussed in the previous section, there are two natural 
information- theoretic measures of the disturbance caused 
by a measurement. The first, which we might refer to as 
disturbance to the input, measures the amount by which 
classical information encoded in a quantum state is de- 
graded by a measurement. That is, it tells how much 
less classical information we can extract from the sys- 
tem about the initial preparation after having made the 
measurement. For this purpose we will assume that the 
observer who wishes to extract the information does not 
have access to the results of the disturbing measurement. 
The reason for using this definition is that it results in 
a sharp contrast between classical and quantum mea- 
surements; for classical measurements this disturbance 
is zero, but can be non-zero for quantum measurements. 

The second kind of disturbance, which we might re- 
fer to as disturbance to the output, is the difference be- 
tween the entropy of the initial state, and the entropy 
of the state given by averaging all the final states. This 
measures the amount of noise that the measurement is 
feeding into the system. This is understood most easily 
by thinking of a noise-driven classical system. The ran- 
dom changes that the quantum system experiences as a 
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result of the random outcomes of the measurement are 
hke the random kicks experienced by a classical system 
under the influence of a noisy force. The strength of this 
noise can be characterized by the entropy increase of the 
probability distribution for the state of the system given 
that the observer has no knowledge of which kick will oc- 
cur. This quantum measure of disturbance is simply the 
equivalent calculation for a quantum system. This kind 
of disturbance is relevant for quantum feedback control, 
and was considered in 0, ^ . The disturbance caused 
by a classical measurement under this definition is zero, 
whereas, for bare quantum measurements, it is greater 
than or equal to zero Thus, both measures provide 
a precise concept of the general notion that quantum 
measurements can produce a disturbance [s^ . whereas 
classical measurements do not. 

Note that the definitions of disturbance which have 
been most studied in the literature to date, for exarnple 
in the work by Fuchs 34] , Banaszek js^l and Barnum 0| , 
characterize disturbance to the input. It is therefore our 
first measure which is most closely related to these no- 
tions of disturbance. 



A. Disturbance to the Input 

Given an encoding, e, the optimal amount of informa- 
tion which an observer Alice can obtain about a message 
encoded by Bob u sing that encoding is called the acces- 
sible information p3a |. If a third observer. Eve, makes 
a measurement, on the system in which the infor- 
mation is encoded, and does not relay the result of the 
measurement to the Alice, then from the point of view of 
the first observer the state of the system is transformed 
by 

p^p' = Y,^npnl (27) 

n 

where the fin are the measurement operators for A4. As 
a result, the encoding is transformed from s to e' where 

e' = {P^, J2 ^nP^^i I I = 1 , • • • , iV} . (28) 

n 

The disturbance caused by the measurement M to the 
ensemble e can then be characterized by the difference 
between the accessible information of e and that of e', 
which we can write as 

-^i(^) — -^access (^) -^access (^ )■ (^'^) 

To obtain a measure of disturbance which characterizes 
the measurement alone, we can do one of at least two 
things. The first is simply to choose e to the be the 
optimal encoding for a perfect channel, which, for an A^- 
dimensional system, is an N- dimensional orthonormal 
basis. In this case the first term is simply the capacity of 
the perfect A^-dimensional quantum channel. The second 



term, however, is not the maximal amount of information 
which can be transmitted from Alice to Bob given that 
Eve makes the measurement Ai, since the optimal en- 
coding e for the perfect channel, may not be the optimal 
encoding for the channel described by the measurement 
operators Qn- Thus, a second approach to characterizing 
the disturbance of the measurement is to choose the sec- 
ond term in the expression for -Di(e) to be the accessible 
information when e' is such as to make this optimal. The 
result is simply the single shot capacity of the channel de- 
scribed by the measurement operators for Ai. Thus, the 
input disturbance is essentially a channel capacity. If we 
write the capacity of the perfect channel as Ccap (2^) , and 
that of the channel described by M as CcapiM), then 
the input disturbance of M is 

A = Ceap(2:)-Ceap(X). (30) 

For classical measurements on classical systems Z^i = 0. 
For quantum measurements Di > 0. To see that Di is 
non-negative one merely needs to note that the accessi- 
ble information is a quantity which is maximized over all 
measurements. As a result, the measurement M made 
by Eve can be included as part of Alice's possible strate- 
gies for obtaining information in the case of the perfect 
channel. Thus, the case in which Eve makes her mea- 
surement is just a special case of the perfect channel in 
which Alice, as part of her strategy, first makes Eve's 
measurement and then throws away the information be- 
fore making any further measurements. Therefore, the 
capacity of the channel described by A4 cannot be more 
than that of the perfect channel, and hence the distur- 
bance Di is a non-negative quantity. 



B. Disturbance to the Output 

The above measure of disturbance is concerned with 
the reduction in the ability of an observer to obtain infor- 
mation about an initial preparation, given that another 
observer has made a measurement on the system to which 
the first observer has no access. Now we consider the re- 
duction in an observer's knowledge of the final state of 
the system, given that a second observer has made a mea- 
surement to which the first has no access. If the initial 
state is p, then the final state of the system after a mea- 
surement M, from the point of view of the first observer, 
is 

p'^Y^^nPnl (31) 

n 

The reduction in the observer's knowledge of the state of 
the system, p, caused by ^A , may be characterized by the 
increase in the von Neumann entropy of p' over p. That 
is, 

D,{p)^S[p']~S[p]. (32) 
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This quantity is zero for classical measurements, and 
Ando has shown that it is non-negative for all bare mea- 
surements 13 • 

No doubt there is more than one way to use the above 
measure to obtain a quantity which characterizes the 
measurement alone, and it may well be that different 
choices may be motivated by different applications. Nev- 
ertheless, one reasonable procedure is to minimize Do(p) 
over all initial states. That is, to define the output dis- 
turbance of a measurement as 



inf S 
p 



S[P\ 



(33) 



This measures the least disturbance that can be achieved 
with the given measurement if one is free to select the 
initial state. Under this definition one has the desirable 
result that all commutative measurements have Do = 0. 



VI. INFORMATION GATHERING AND 
STATE-SPACE SYMMETRY 

A. Classical Measurements 

Since classical measurements have already been intro- 
duced in Section m we will merely pause to clarify our 
definitions: By a classical measurement, we will mean a 
measurement which can be performed on a classical sys- 
tem. Thus a classical measurement is not only one in 
which all the measurement operators commute, but also 
one in which is it understood that the density operator 
is also required to be diagonal in the same basis as these 
operators. We will refer to measurements in which the 
only restriction is that the operators commute as com- 
mutative quantum measurements. The reason for this 
distinction is that it is possible to enhance the proper- 
ties of measurements with unitary transformations |37j . 
Since such transformations require transforming the den- 
sity matrix so that it is diagonal in other bases, one must 
make a distinction between classical measurements and 
commutative measurements; the latter can be enhanced 
in ways in which the former cannot. 

For classical measurements, there is no difference be- 
tween the information about the initial encoding, and 
that about the state which exists after the measurement, 
because the measurement does not interfere with the ini- 
tial preparation; both A Ji {M , e) and A// {M , p) reduce 
to the classical mutual information, which we will write 
simply as AI{A4, p). Thus K ~ C, and the measurement 
strength Kj is the mutual information between the out- 
put results and the initial preparation when the encoding 
is the uniform ensemble over the all the available initial 
states. 

Now, the mutual information is the average decrease in 
the Shannon entropy of the observer's state of knowledge 
as a result of the measurement, and depends, in general 
upon not only the measurement but also the initial state. 



How exactly, does it depend on the initial state? Since 
the entropy of the state cannot decrease below zero, one 
might suggest that the information gain in the measure- 
ment decreases as the uncertainty in the observer's initial 
state of knowledge decreases: the more you know the less 
you find out. However, from the fact that the mutual in- 
formation is not always maximized when the ensemble 
has maximal entropy, we know that it is possible for the 
reverse to be true for at least some measurements and 
some states. So is there a sense in which a reduction in 
information gain with increasing knowledge is a funda- 
mental property of measurement? 

To answer this question, we consider a class of measure- 
ments which we will refer to as permutation- symmetric 
measurements. We define these measurements as those 
which are symmetric under the permutation of two clas- 
sical basis states. That is, when any two states |i) and 
\j) are swapped, the set of measurement operators re- 
mains unchanged — the various measurement operators 
merely interchange among themselves. The motivation 
for such a definition is the observation that the informa- 
tion extraction capability (or accuracy, or resolution) of a 
measurement can, in general, vary across the state space, 
and it is this that causes some measurements to provide 
a larger average amount of information when applied to 
states with less that maximal entropy. The definition of 
permutation-symmetric measurements ensures that they 
have the same resolving power over all of state space. 
We now show that these measurements have the prop- 
erty which we seek. First it is helpful to introduce some 
notation. We will denote the vector of eigenvalues of a 
matrix A by X{A). If we write p ^ a then we will mean 
that X{p) ^ X{a) 

Theorem: If A4 is classical and permutation- 
symmetric, then for any two classical states p and cr, 
p ^ a implies that AI{M,p) > AI{M,a). (Note that 
a more concise way of saying this is that AI{Ai,p) is 
Schur- concave in p 

Proof: To begin we note that given a diagonal oper- 
ator n of dimension N, the sum of the A^! operators, 
{r2„i} which are the different permutations of f2, is pro- 
portional to the identity. This means that the operators 
{ailm} form a valid measurement for some a. In what 
follows when we refer to an operator Q which is used to 
generate a permutation-symmetric measurement by tak- 
ing all its permutations, we will always define Q so that 
Tr[£;] = Tr[f72] = 1, in which case a = l/^/{N - 1)!. We 
will refer to a measurement generated by a single opera- 
tor as an irreducible permutation-symmetric measure- 
ment (IPM). Clearly all permutation-symmetric mea- 
surements can be written as mixtures (in the sense of 
the mixing operation described in section llTT|l of IPM's. 
Now consider the mutual information for an IPM. This 
may be written as 

A/(M, p) - H\Qp(rn)\ ~ ^ Pp(n)i?(Q,(m|n)), (34) 
where Qp{m) is the distribution of the measurement out- 
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comes, Pp{n) is the distribution of the initial states (be- 
ing the diagonal of p), and Qp{m\n) is the distribution 
of measurement outcomes given that the initial state is 
n. The first thing to note is that due to the permutation 
symmetry, H{Qp(m\n)) is the same for all m. Thus, the 
second term is independent of the initial state p, and we 
need merely show that 

H[Qp{n)]> H[Q„{n)] when p ^ a. (35) 

For this it is sufficient to show that Qp -< Q„ if p ^ 
cr. That is, that the operation that transforms p to Qp 
preserves majorization. The operation that transforms p 
to Qp may be written as 

Qp = Ap^, (36) 

where p„ is the vector consisting of the diagonal of p, 
and A is a matrix whose rows are the diagonals of the 
operators VirrMln- The rows of A therefore consist of all 
the permutations of any given row. At this point we 
can invoke a theorem of Chong j who has shown 

that the operation of multiplication by a matrix preserves 
majorization if the matrix is such that the rows form a 
permutation invariant set. That is, if all permutations 
of any row of the matrix are also rows of the matrix. 
We will refer to matrices of this form in what follows as 
Chong matrices. This proves the result for IPM's. Since 
every permutation-symmetric measurement is a mixture 
of IPM's, and A/ is linear under mixing, the result holds 
for all permutation-symmetric measurements. □ 

That is, once we have eliminated the state-space de- 
pendence of the resolution of measurements, a fundamen- 
tal property remains. This is that, the more one knows, 
the less is the amount that one will learn by applying a 
given measurement. 

As a corollary of this, we have that the classical capac- 
ity of a classical irreducible permutation-symmetric mea- 
surement is attained by the uniform distribution, and is 
therefore given by 

C = \nN-S{E), (37) 

where S is the von Neumann entropy, and E = fl^. 
In addition, it is worth noting that since both classi- 
cal channels and classical measurements are defined by 
a complete set of conditional probabilities, classical mea- 
surements and classical channels are one and the same 
thing. Thus, Ea. l|^ also gives the capacity of irreducible 
permutation-symmetric classical channels. The capacity 
of a general permutation-symmetric measurement 

N 

X-^p„AA„, (38) 

Tl=l 

where each of the measurements A/'„ is generated by the 
operator En is thus 

N 

C^\nN-J2PnS{En). (39) 

ra=l 



B. Unitarily Covariant Measurements 

In the previous subsection we found a class of classi- 
cal measurements for which the information provided by 
the measurement always increased with the initial uncer- 
tainty. In this section we turn our attention to quantum 
measurements. In this case, it is the Unitarily Covari- 
ant Measurements (UCM's), to be defined below, which 
are the equivalent of the classical permutation-symmetric 
measurements, in that they are invariant under all re- 
versible transformations of the state-space. However, 
in considering the information gathering properties of 
UCM's we need to be a little more precise about what 
we mean by 'information about the initial state'. 

In the previous section we assumed that the informa- 
tion was always encoded in individual (or pure) states, 
rather than probability densities of states (or mixtures). 
What this assumption meant is that Ali , which measures 
the information provided about the encoded message, 
also measured the information obtained about which 
state the system was initially in. Note that, since clas- 
sical systems are always in some 'pure' state, it is not 
ambiguous to ask what state the system is in, even if the 
initial encoding is in mixed states. However, if the initial 
encoding does use mixed states, Ali no longer measures 
the information that the measurement provides about the 
what state the system is in. 

When we consider obtaining a quantum equivalent of 
the theorem of the previous section, then we need to use 
the equivalent notion of information - that is, informa- 
tion about what pure state the system is in. Now, in 
quantum mechanics, in general the system does not have 
to be in any pure state, since an alternative exists - it 
could be entangled with another system. Thus, we need 
to be clear that the situation we are concerned with here 
is that in which the system is known to be in some pure 
state. The information we are concerned with is the in- 
formation provided about what that state is, and this is 
what AIi{A4, e) measures so long as e is an ensemble of 
pure states. In what follows our analysis will therefore 
be restricted to ensembles of pure states. 

To begin, the unitarily covariant measurements ^HE3 
are defined as being those measurements which are in- 
variant under every unitary transformation of the mea- 
surement operators; that is, when all the measurement 
operators are transformed by a given unitary, all the mea- 
surement operators merely transform among themselves. 
Thus, every UCM has a continuum of measurement op- 
erators, labeled by the unitary transforms U. 

Now, we wish to show that the UCM's are a class of 
quantum measurements for which the information gain 
about the initial pure state ensemble, AJi(A^,e), al- 
ways increases with the initial uncertainty. Now we must 
ask, 'the initial uncertainty of what?'. Since, in general, 
AIi{Ai, e) is a function of the initial ensemble, one might 
expect that we would have to consider some uncertainty 
property of the ensemble, rather than the initial state, p. 
However, it turns out that this is not the case: 
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Theorem: The information A/i {Ai , e) , obtained about 
a pure-state ensemble e by a unitarily covariant measure- 
ment M , depends on the ensemble only through the den- 
sity matrix p = I], where e = {P{i), IV'i)}- 

Proof: The expression for the information gain is 

AI{M,e) = H[Qp{U)] 

- J2Pp{\^JJm))H{Qp{U\\^Prn))). (40) 
rn 

Since Ai is unitarily covariant, H{Qp{U\\ipm))) is the 
same for all initial states |V'm): a-nd therefore the second 
term is the same for all initial ensembles. As a result, 
AI{M,p) depends only on the first term, which depends 
only on p. □ 

In view of the above result, we now show that under 
unitarily covariant measurements p(£)) is 

Shur-concave in p. 

Theorem: If A4 is unitarily covariant, then for any 
two states p and a, p < a implies that A/i(A^,p(e)) > 
A/i(A^, (T(e')), where e and e' are ensembles of pure 
states. 

Proof: First we note that given a positive operator 
we have 



j UEUUp{U) = Tt[E]I, 



(41) 



where E — H.^ and dp{U) is the (unitarily covariant) 
Haar measure over unitary operators U |4C|. Thus, to 
generate a covariant measurement from an operator 17, 
we first scale Q. so that Tr[£^] = 1. We will refer to a UC 
measurement generated from a single operator 57 as an 
irreducible UC measurement. Now, Ea. l|lT)l also tells us 
that for any covariant POVM, each subset of the mea- 
surement operators which is closed under unitary trans- 
formations form themselves a POVM. As a result we can 
restrict ourselves to measurements in which all the mea- 
surement operators are obtained from each other by a 
unitary transform; all other unitarily covariant measure- 
ments can be obtained from measurements of that form 
by mixing. 

Now consider the information gain 

A/(A^,p) - H[Qp{U)] 

Now, since the measurement is unitarily covariant, 
H{Qp{U\\^m))) is independent of the state |V'm)- As 
a result, the second term in the expression for A/ is in- 
dependent of the ensemble probabilities Pp{\'ipm))i and 
therefore of the initial state p. Thus we need merely con- 
sider the first term, H[Qp{U)]. Now since 

H[Qpm = [ fiTT[UEU^p])dp{U) (43) 



(where f{x) = —xlnx), H is invariant under a unitary 
transformation of p, and as a result we can always choose 



the eigenvectors of p to be the same as the eigenvectors of 
E. Denoting these eigenvectors by {|j)}, and the eigen- 
values of p and E as {A^} and {Ei}, respectively, we have 



Qp{U) = Tr[UEU\Y.X,\j){j\)] 



E 



Y.E,M,,{U) 



(44) 



where Mij{U) = \Uij\'^ is a doubly stochastic matrix. 

The quantity in the square brackets has two indices: 
the column index, j, is discrete and the 'row' index, U, 
is continuous. The sum over j is a discrete-to-continuous 
version of matrix multiplication. This continuous nature 
will make the following discussion more involved than 
that in previous section, but the complication is primarily 
technical. 

First we make the problem discrete by choosing a fi- 
nite set of points on which to sample the continuous set 
of unitary transformations U. Let us denote the set of 
points as {Uk : k ^ I, . . . , L}, and the sampled values of 
Q at those points, as Qp{Uk)- The set of these values is 
now a vector with L elements. The discrete transforma- 
tion may now be written as 



QpiUk) = 



(45) 



Now, for every sample point Uk, we have a doubly 
stochastic matrix, Mij{Uk)- The fact that the continuum 
transformation includes every value of U (that is, every 
unitary transformation), means that, given any sampling 
{Uk '■ k — 1, . . . ,L}, we can extend this sampling to in- 
clude every column permutation of every one of the ma- 
trices Mij{Uk)- (By a column permutation we mean an 
operation in which one or more of the columns of M are 
swapped.) That is, we can choose a new set, {Ui :}, 
such that it contains {Uk '■ k — 1,...,L} as a subset, 
and the set {Mij{Ui) : I — 1,...,L'} is such that for 
every element M, every column permutation is also an 
element. The transformation for this extended set is now 
majorization preserving, since 



QpiUi) - E 



Y,E^M,,{Ui) 



A, 



(46) 
(47) 



and Mij is a Chong matrix. This is because the opera- 
tion of summing over the index i in the first line above 
simply involves summing each of the columns of each 
stochastic matrix Mij, to create one row of the matrix 
Mij. Since every column permutation of each Mij exists, 
the resulting matrix, Mij , is such that every permutation 
of each row of Mij appears as another row of Mij , which 
is Chong's condition. 
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Our goal is naturally to take the limit as L tends to 
infinity, to obtain the continuum result that we want. 
However, before we do so we must take into account the 
measure. In calculating the entropy of the density Qp{U), 
we integrate over the Haar measure. A discrete approxi- 
mation of this integral for our vector Qp{Ui) is 



L' 



(48) 



1=1 



where as before f{x) — —xlogx, and A(Ui) is the Haar 
volume associated with each sample point Ui. It is impor- 
tant for our purposes that in the above equation we asso- 
ciate the same Haar volume with all sample points which 
are obtained from each other by a permutation. (We will 
refer to the L sets containing A''! points, all permutations 
of each other, as the L permutation-invariant sets.) To 
see that this is possible, for any value of L, one first notes 
that the group of permutations forms a finite subgroup 
(containing iV! elements) of the group of all unitary trans- 
formations. As such, it can be used to divide the set of 
all unitaries up into A^! subsets, where each is the image 
of the others under the action of a permutation. Since 
the Haar measure is invariant under unitary transforma- 
tions, each of these subsets has not only the same Haar 
volume, but any region defined within one has the same 
volume when that region is mapped to another. Because 
of this last property, we can do the following: To ensure 
that all the points within a given permutation-invariant 
set have the same associated Haar volume, we can (for 
example) chose our original L points all to lie within one 
of the A^! factorial subsets, choosing the Haar volumes 
associated with each in whatever way we see fit. When 
we extend the set of points to include all permutations, 
the regions associated with the points in the first subset 
are mapped to each of the other A^! — 1 subsets. Each 
of the new points thus has the same Haar volume as the 
original point of which it is the image, which means that 
all the points in a given permutation-invariant set have 
an associated region with the same Haar volume. 

Thus, we can break the vector of points Ui into L sub- 
vectors, where the points in each are the points of one 
of the L permutation- invariant sets, and which conse- 
quently all have the same associated Haar volume. The 
importance of this is that each of these subvectors is ob- 
tained from the Aj's by a Chong matrix. We can now 
write the above summation as. 



H[Qp(.Ui)] - E ^('^«) 



. in 



(49) 



which is simply a weighted sum of the entropies of each of 
the subvectors. Since each of the subvectors is obtained 
from the vector X{p) by a transformation of Chong form, 
the entropy of each is Shur-concave in X{p), and hence 



a p ^ cr. Now, finally, we can take the limit _L — > oo, 
so that we recover the Haar integral, and H[Qp{Ui)] 
H[Qp{U)]. Since we know that Eg. lfSUI) is true for each 
L in the sequence, it is true in the limit, and we ob- 
tain our result for all ICM's. Since all unitarily covari- 
ant measurements can be obtained ICM's by mixing (i.e. 
averaging), the result follows for all unitarily covariant 
measurements. □ 

The following corollary is an immediate consequence: 
Corollary: The classical capacity of a unitarily covari- 
ant measurement is obtained by the uniform ensemble 
over any basis. 

In addition to the above result, we conjecture that the 
final information, AIf(M,p), is also Shur-concave in p 
for all unitarily covariant measurements. 



Classical Capacities for Symmetric and 
Covariant Measurements 



To obtain the classical capacity of a quantum mea- 
surement A4 one must optimize A/i(M,e) over all initial 
encodings. However, it turns out the complexity of this 
procedure is significantly reduced for commutative mea- 
surements. This is because, as we now show, the clas- 
sical capacity of a commutative quantum measurement 
is the same as that of the equivalent classical measure- 
ment. (A commutative and a classical measurement will 
be said to be equivalent when the measurement operators 
of the commutative measurement are, upon diagonaliza- 
tion, the same as those of the classical measurement.) 
This gives us the classical capacity of all Commutative 
Permutation-Symmetric (CPS) measurements in terms of 
the previous results for classical permutation-symmetric 
measurements. 

Theorem: The classical capacity of a commutative 
measurement is the same as that of the equivalent clas- 
sical measurement. 

Proof: Let M he a, commutative measurement, and 
let us denote the basis in which the measurement oper- 
ators of A4 are diagonal as {|«)}. Note that when we 
use the states {\i)} to encode information, the behav- 
ior of the commutative measurement is exactly the same 
as the equivalent classical measurement. Consider now 
the conditional probability, Q(j| I?/")), for outcome j given 
that the initial state is the arbitrary state \tp) — Q^jN)- 
This is 

Q{j\m = Tr[E,\i;)m^^\a.\^Q{m). (51) 

i 

But this is precisely the same expression we would have 



obtained if we had used the mixture p — J2i 



as 



H[QpiUi)] > H[Q,iUi)] 



(50) 



the encoding state, instead of the pure state lip). Since 
the expression for the classical capacity can be written 
entirely in terms of the conditional probabilities Q{j\\ijji)) 
(along with the probabilities P{i)), this means that en- 
coding using any state which is not one of the eigenstates 
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|i) renders the same information as encoding using a mix- 
ture of the eigenstates. Since using a mixture is never 
better than using a single state, the capacity is achieved 
by encoding using the eigenstates. In this case the com- 
mutative measurement reduces to the equivalent classi- 
cal measurement, and their respective capacities are the 
same. □ 

Thus the classical capacity of CPS measurements is 
given by Eqs.ljSH) and 

It was shown in the previous section that an ensem- 
ble which achieves the classical capacity for all unitarily 
covariant measurements is the uniform distribution over 
any basis. Using this we calculate the resulting clas- 
sical capacity for irreducible UC measurements in Ap- 
pendix ^ which is 



C 



InN -Q{E) 



^ 1 



(52) 



fc=2 



where Q{E) is the subentropy of the operator E, as de- 
fined by Jozsa, Robb and Wootters [ij ■ The capacity in 
nats of a general UC measurement. 



N 



(53) 



where each of the irreducible UC measurements Mn is 
generated by the operator En, is thus 



N N 

c = in7v~^p„g(i;„)-^- 



(54) 



k=2 



The capacity in nats of the complete unitarily covariant 
measurements (of which there is only one for each di- 
mension TV), being a special case of the above formula, 
is 



Bare: Measurements in which every measurement op- 
erator is a positive operator. Alternative terms which 
have been used for these kind of measurements are vur e 
measurements 53 : measurements without feedback |l5j| , 
and square root measurements |16| . 

Classical: Measurements which can be performed on 
systems which lie within the domain of classical physics. 
These are measurements in which the all the measure- 
ment operators are positive, mutually commuting, and 
also commute with the density matrix describing the 
state being measured. 

Commutative: Measurements in which the all the 
measurement operators are mutually commuting. 

Complete: Measurements in which all the effects are 
proportional to rank-1 projectors |4l| . An alternative 
name for these measurements is maximal strength. 

Finite- Strength: Measurements in which all the mea- 
surement operators are of full rank ^15^|. 

Infinite- Strength: Measurements in which at least 
one of the measurement operators has at least one zero 
eigenvector. 

Incomplete: Measurements in which at least one of 
the effects is higher than rank one. 

Permutation- Symmetric: Measurements in which, 
if any two of the basis states are permuted, the measure- 
ment remains unchanged. That is, under a permutation 
of basis states, the measurement operators merely trans- 
form among themselves. 

Unitarily Covariant: Measurements in which, if a 
unitary transformation is applied to the measurement op- 
erators, the measurement remains unchanged. That is, 
under a unitary transformation, all the measurement op- 
erators transform among themselves. 

von Neumann: Measurements in which the all 
the measurement operators are commuting (orthogonal) 
rank-1 projectors. 



C 



^ 1 

^ h 



(55) 



fe=2 
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APPENDIX A: TERMINOLOGY 

The following names are used in the body of the paper 
to designate various classes of measurements: 



APPENDIX B: CLASSICAL CAPACITY OF 
UNITARILY COVARIANT MEASUREMENTS 

To calculate the classical capacity it is most conve- 
nient to use the form for the mutual information given in 
Ea. (|ll|l . For Unitarily covariant measurements the clas- 
sical capacity is attained by an ensemble consisting of the 
uniform distribution over any basis, so we have 



C{M) - sup i/[P(z)]-^Q(j)i/[P(z|j)] 

= \nN- J H[P{i\U)]d^i{U) 
= In TV - / H[NQ{U\\i))P{i)]d^l{U) 



= InN + N J Q{U\\iJ))\IlQ{U\\^))d^IiU) 
= liiN -NH[QiU\mi (Bl) 
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where we have used the fact that Q{U\\i)) is independent 
of and it should be noted that in the second to last 
line, \ip) simply represents any pure state, and dfi{U) is 
the unitarily covariant Haar measure over unitary trans- 
formations. To continue, 



N 



(B2) 



Thus the entropy of the conditional probability density 
is 



H[Qiu\\m 



N 



N 



J=l \k=l / 

N / N \ 

Y,E,\{m)\'\nij2EkMEk)\']df,m, 



(B3) 



where diJ,{\ip)) is the unitarily covariant measure over pure states Writing Pi — \ {ip\Ej)\'^, this becomes an integral 
over the uniform measure on the probability simplex 0,^3; being the volume defined by Pi < 1. That is 

NH[QiU\\m ^ -Nl I I ■■■ I ^i;,P,ln ^i;fcPj dPAr_i---dPi, (B4) 

j=i \k=i J 



"'0 "'0 



r 



where Pn = 1 — X]i=T^'' ^n- Evaluating this integral 
is non-trivial, but it has been solved by Jones ^^\, and 
two alternative methods are given by Jozsa, Robb and 
Wootters [o. The solution is 



NHlQiUWm 




(B5) 



fc=2 



where Q{E) is the subentropy of the operator E, as de- 
fined by Jozsa, Robb and Wootters. It might appear that 
this blows up when any of the eigenvalues of E are equal 
— however this is not the case; the value of iJ[Q([/| |?/;))] 
in the limit as £'„ — > Em, for any n and m, remains fi- 



nite ■41]. In fact, Q{E) < S{E),yE. Combining Ea. (|B5|) 
with Eq. (jBl|) , gives the capacity (in nats) of all covariant 
measurements generated from a single operator fl — \fE. 
It is shown in Rcf. 41] that the maximum value of the 
right hand side of Eq. (|B5|) is In A^, which is obtained when 
E — I /N . Thus the capacity of the covariant measure- 
ment generated hy Vl = I /\/N is zero as required. 

For complete measurements, Ei — 1 and Ej = for 
j > 1, and the integral in Ea. HB4|l reduces to 



-{N~l) / (1 



1 

N 



P)^-^P\n {P)dP 



Combining this with Eq. (|Bip gives the classical capacity 
(in nats) for the complete unitarily covariant measure- 
ments. 
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